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Abstract 

Let G(n, m) be the random graph on n vertices with m edges. Let d = 2m jn be its average degree. 
We prove that Gin, m) fails to be fc-colorable w.h.p. if d > 2k In k — In k — 1 + Ok (1). This matches a 
conjecture put forward on the basis of sophisticated but non-rigorous statistical physics ideas (Krzakala, 
Pagnani, Weigt: Phys. Rev. E 70 (2004)). The proof is based on applying the first moment method to 
the number of "covers", a physics-inspired concept. By comparison, a standard first moment over the 
number of fc-colorings shows that G(n, m) is not fc-colorable w.h.p. if d > 2fc In k — In fc. 
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1 Introduction 

Let G(n, m) be the random graph on V — {1, . . . , n} with m edges. Unless specified otherwise, we let 
m = \dn/2~\ for a number d > that remains fixed as n — > oo. Let k > 3 be an n-independent integer. 
We say that G(n, m) has a property £ with high probability ('w.h.p.') //'lini„^ 00 P [G(n, m) £ £] = 1. 

One of the longest-standing open problems in the theory of random graphs is whether there is a phase 
transition for fc-colorability in G(n, m) and, if so, at what average degree d it occurs 0][T0][T7). Regarding 
existence, Achlioptas and Friedgut 0] proved that for any k > 3 there is a sharp threshold sequence 
dk-coi(n) such that for any fixed e > the random graph G(n, m) is fc-colorable w.h.p. if m/n < (1 — 
e)dfe_ co i(n) and non- fc-colorable w.h.p. if m/n > (l + e)d k - co i(n). To establish the existence of an actual 
sharp threshold, one would have to show that the sequence d k - C o\{n) converges. This is widely conjectured 
to be the case (explicitly so in Q) but as yet unproven. 

In any case, the techniques used to prove the existence of dk- co \{n) shed no light on its location. An 
upper bound is easily obtained via the first moment method. Indeed, a simple calculation shows that for 
fc > 3 and 

d > rffc,first = 2fc In k — In k, (1) 

the expected number number of fc-colorings tends to as n — >• oo (e.g., J4|). Hence, Markov's inequality 
implies that G(n, m) fails to be fc-colorable for d > dkfirst w.h.p. Furthermore, Achlioptas and Naor [6 1 
used the second moment method to prove that for any fc > 3, G(n, m) is fc-colorable w.h.p. if 

<4-col > 4,AN = 2(fc - l)ln(fc - 1) = 2fclnfc - 21nfc - 2 + o fe (l). (2) 

Here and throughout the paper, we use the symbol Ofc(l) to hide terms that tend to zero for large fc. The 
bound (0 was recently improved JT2), also via a second moment argument, for sufficiently large fc to 

dfc-col > rffc,socond = 2fclnfc - lnfc - 21n2 + o fe (l). (3) 
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This leaves an additive gap of 2 In 2 + Ok (1) between the upper bound (Q]i and the lower bound (01. 

The problem of fc-coloring G(n, m) is closely related to the "diluted mean-field fc-spin Potts antiferro- 
magnet" model of statistical physics. Indeed, over the past decade physicists have developed sophisticated, 
albeit mathematically non-rigorous formalisms for identifying phase transitions in random discrete struc- 
tures, the "replica method" and the "cavity method" (see 11311 for details and references). Applied to the 
problem of fc-coloring G(n, m) Il27l[33l[34ll38l . these techniques lead to the conjecture that 

dfe_ co i = 2fclnfc - lnfc - 1 + o fc (l). (4) 

The main result of the present paper is an improved upper on dk- C o\ that matches the physics prediction dU 
(at least up to the term hidden in the Ofc(l)). 

Theorem 1.1 We have eifc-coi < 2fc In k — In fc — 1 + 0^(1). 

Theorem 11.11 improves the naive first moment bound (Q3 by about an additive 1. This proves, perhaps 
surprisingly, that the fc-colorability threshold (if it exists) does not coincide with the first moment bound. 
Furthermore, Theorem II . 1 I narrows the gap to the lower bound <j3j to 2 In 2 — 1 + 0^(1) w 0.39. 

The proof of Theorem ll.ll is based on a concept borrowd from the "cavity method", namely the notion 
of covers. This concept is closely related to hypotheses on the "geometry" of the set of fc-colorings of 
the random graph, which are at the core of the cavity method [38 , 31 26, 27] [33] 137). More precisely, let 
Sk(G(n, m)) C {1, . . . , fc}™ be the set of all fc-colorings of G(n, m). According to the cavity method, for 
average degrees (1 + Ofc(l))fc In fc < d < dk-co\ w.h.p. the set Sk(G(n, m)) has a decomposition 

JV 

S k (G(n,m)) = \JCi 

i=l 

into N = cxp(f2(n)) non-empty "clusters" d such that for any two colorings er, r that belong to distinct 
clusters we have 

dist(cr, t) = \{v £ V : <r(v) ^ t(w)}| > 5n for some S = 8{k 1 d) > 0. 

In other words, the clusters are well-separated. Furthermore, a "typical" cluster d is characterized by a set 
of tt(n) "frozen" vertices, which have the same color in all colorings a £ C;. Roughly speaking, a cover is 
a representation of a cluster Cf. the cover details the colors of all the frozen vertices, while the non-frozen 
ones are represented by the "joker color" 0. We will define covers precisely in Section[3] 

The key idea behind the proof of Theorem 11.11 is to apply the first moment method to the number 
of covers. Since, according to the cavity method, covers are in one-to-one correspondence with clusters, 
we carry effectively out a first moment argument for the number of clusters. The improvement over the 
"classical" first moment bound for the number of fc-colorings results because this approach allows us to 
completely ignore the cluster sizes \d\. Indeed, close to the fc-colorability threshold the cluster sizes are 
conjectured to vary wildly, as has in part been established rigorously in lfT2l . By contrast, the "classical" 
first moment argument amounts to putting a rather generous uniform bound on all the cluster sizes. 

The "freezing" of vertices in fc-colorings of G(n,m) has been studied previously Q. Formally, let 
us call a set F of vertices 5-frozen in a fc-coloring a of G(n, m) if any other fc-coloring r such that 
t(v) <t(v) for some vertex v £ F indeed satisfies \{v £ F : a(v) ^ t(u)}| > Sn. There is an explicitly 
known sharp threshold dk, freeze = (1 + Ofc(l))fclnfc, about half of dk-coh sucn that for d > dkjreeze 
w.h.p. a random fc-coloring of G(n,m) has fi(n) frozen vertices 11321 . The threshold defreeze coincides 
asymptotically with the largest average degree for which efficient algorithms are known to find a fc-coloring 
of G(n, m) w.h.p. [3 , 2 1 1. In fact, it has been hypothesized that the emergence of frozen vertices causes 
the failure of a wide class of "local search" algorithms i2] |32l . 

Yet the known results |2] [32) on the freezing phenomenon only show that a random fc-coloring of 
G(n, m) "freezes". It is not apparent that this poses an obstacle if we merely aim to find some fc-coloring. 
As an important part of the proof of Theorem ll.il we show that for d close to dfc_ co i (but strictly below the 
lower bound (0), in fact all fc-colorings of G(n, m) belong to a cluster with many frozen vertices w.h.p. 
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Corollary 1.2 Assume that d > 2fc In fc — In fc — 4 + Ofc(l). There is a number 6k > rac/i f/ta? w.h.p. 
every k-coloring a of the random graph G(n,m) has a set F(o~) of 8k- frozen vertices of size \F(cr)\ > 
(l-o fe (l))n. 

Due to the (conjectured) relationship between freezing and the demise of local-search algorithms, it 
would be interesting to identify the precise threshold where all the fc-colorings of G(n, m) are frozen. 

Further related work. The problem of coloring G(n, m) has been studied intensively over the past few 
decades. Improving a prior result by Matula fl30l . Bollobas (9) determined the asymptotic value of the 
chromatic number of dense random graphs. Luczak extended this result to sparse random graphs ll28l . In 
the case that d remains fixed asn^ oo, his result yields dk- C o\ = (2 + Ok(l))k In k. As mentioned above, 
Achlioptas and Naor [6 | improved this result by obtaining the lower bound ©. In addition, Luczak's result 
was sharpened in ifTTI for m <C n 5 / 4 . 

The problem of locating the threshold for 3-colorability has received considerable attention as well. The 
best current lower bound is 4.03 [Q. Moreover, Dubois and Mandler lfl4l proved that d3„ co i < 4.9364. 
This improved over a stream of prior results [2] Q3] [18] |20] |24] . 

The key idea in this line of work is to estimate the first moment of the number of "rigid" colorings: 
for any two colors 1 < i < j < k, every vertex of color i must have neighbors of color j J4] . Clearly, 
any fc-colorable graph must have a rigid fc-coloring. At the same time, the number of rigid fc-colorings can 
be expected to be significantly smaller than the total number of fc-colorings, and thus one might expect 
an improved first-moment upper bound. However, in terms of the clustering scenario put forward by 
physicists, it is conceivable that many clusters contain a large (in fact, exponentially large) number of 
rigid fc-colorings. Therefore, the idea of counting rigid fc-colorings seems conceptually weaker than the 
approach of counting clusters pursued in the present work. In fact, the improvement obtained by counting 
rigid colorings appears to diminish for larger fc 0). 

A fairly new approach to obtaining upper bounds on thresholds in random constraint satisfaction prob- 
lems is the use of the interpolation method JU [19] [22] [35) . This technique gives an upper bound on, e.g., 
the fc-colorability threshold in terms of a variational problem that is related to the statistical mechanics 
techniques. However, this variational problem appears to be difficult to solve. Thus, it is not clear (to me) 
how an explicit upper bound as stated in Theorem 1 1.1 l ean be obtained from the interpolation method. 

Dani, Moore and Olson I1T31 studied a variant of the graph coloring problem in which each pair of (u, v) 
of vertices comes with a random permutation ir u tV of the fc possible colors; this gives rise to a concept of 
"permuted" fc-colorings. They obtained an upper bound of 2fclnfc — lnfc — 1 + 0^(1) on the threshold 
for the existence of permuted fc-colorings. The proof is based on counting the total weight of fc-colorings 
and using an isoperimetric inequality. Moreover, as pointed out in |fT3l . physics intuition suggests that the 
threshold in the permuted fc-coloring problem matches the "unpermuted" fc-colorability threshold. 

In the context of satisfiability, Maneva and Sinclair l29l used the concept of covers to obtain a condi- 
tional upper bound on the random 3-S AT threshold. Roughly speaking, the condition that they need is that 
w.h.p. all satisfying assignments have frozen variables. However, verifying this condition in random 3-SAT 
is an open problem. (That said, it is conceivable that the approach used in ||29l might yield a better upper 
bound on the fc-SAT threshold for large fc.) 

2 Preliminaries 

Let [fc] = {1, 2, . . . , fc}. Because Theorem l 1 . 1 l and Corollary II. 2 l are asymptotic statements in both fc and n, 
we may generally assume that fc > fco and n > uq, where fco, no are constants that are chosen sufficiently 
large for the various estimates to hold. 

We perform asymptotic considerations with respect to both fc and n. When referring to asymptotics in 
fc, we use the notation Ok(-), Ok(-), etc. Asymptotics with respect to n are just denoted by O(-), o(-), etc. 

If G is a (multi-)graph and A, B are sets of vertices, then we let ec(A, B) denote the number of A-B- 
edges in G. Moreover, ec{A) denotes the number of edges inside of A. If A = {v} is a singleton, we just 
write ea(v, B). The reference to G is omitted where it is clear from the context. 
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Working with independent edges. The random graph G(n, m) consists of m edges that are chosen 
almost independently. To simplify some of the arguments below, we are going to work with a random multi- 
graph model G'(n, m) in which edges are perfectly independent. More precisely, G"(n, m) is obtained as 
follows: let e = (ei, . . . , e m ) G (V x V) m be a uniformly random m-tuple of ordered pairs of vertices. In 
other words, each is chosen uniformly out of all n 2 possible vertex pairs, independently of all the others. 
Now, let G'(n, m) be the random multi-graph comprising of ex, ■ ■ ■ , e m viewed as undirected edges. Thus, 
G'(n, m) may have self-loops (if = (v, v) for some index i) as well as multiple edges (if, for example, 
&i = (u, v) and e 3 - = (v, u) with 1 < i < j < m and u ^ v). The two random graph models are related 
as follows. 



Lemma 2.1 For any event A we have P [G(n, m) G A] < 0(1) • P [G'(n, m) G A]. 

Proof. The random graph G'(n, m) has at most m distinct edges, and no self-loops. Let £ be the event that 
it has exactly m edges. This is the case iff ei, . . . , e m induce pairwise distinct undirected edges. Given the 
event £, G'(n 1 m) is identical to G(n, m). Hence, 



P [G(n, m) e -4] = P [G'(n, m) e A\£] < P [G'(n, m) € A] /P [£} 



(5) 



Now, 



p[f] > ni 1 



i=0 



2i + n 



Thus, the assertion follows from (0. 



exp 



Ml - (2i + n)/n 2 



i=0 



> exp(-d- 2d 2 ) = 



□ 



The Chernoff bound. We need the following Chernoff bound on the tails of a binomially distributed 
random variable (e.g., j23] p. 21]). 

Lemma 2.2 Let ip(x) = (1 + x) ln(l + x) — x. Let X be a binomial random variable with mean /i > 0. 
Then for any t > we have 

P[X >E[X]+t] < Bxp{-n-(p(t/n)), 
P[X <~E[X]-t] < exp(-fji-(p(-t/n)). 

In particular, for any t > 1 we have P [X > i/i] < exp [— tfx ]xx(t/e)] . 



Balls and bins. Consider a balls and bins experiment where [i balls are thrown independently and uni- 
formly at random into v bins. Thus, the probability of each distribution of balls into bins equals . We 
will need the following well-known "Poissonization lemma" (e.g., Ifl6l Section 2.6]). 

Lemma 2.3 In the above experiment let e; be the number of balls in bin i £ [v]. Moreover, let A > 
and let (&i)ieM be a family of independent Poisson variables, each with mean A. Then for any sequence 
(U)i£[is] of non-negative integers such that $3 i=1 U = fJ-we have 



P |Vt € [v] : e, = U] = P 



Vi G [u] : h = U 



Hence, the joint distribution of(ei) i£ ^ coincides with the joint distribution o/(&i)ie[^] giv en 5Zi=i ^ = I 1 - 



We are typically going to use Lemma 1231 to obtain an upper bound on the probability on the left hand 
side. Therefore, the following simple corollary will come in handy. 

Corollary 2.4 With the notation ofLemma \2.3\ assume that A = fx/u > 0. Then for any sequence (ti) i& ^ 
of non-negative integers such that Xa=i U = l^we have 



P [Vi G [v\ : ei = U] < 0(y/JT) • P [Vi G [v\ : b, = U] 



4 



Proof. Let b = Y2i=i ^» = l l - Since the bi are independent Poisson variables with means A = \xjv, 
b is Poisson with mean /i. By Stirling's formula, P [6 = fi] = ^ exp(— = f2(^~ 1//2 ). Hence, 
Lemma 12.31 yields 

P^i€[ V ]:e i = U] = P ^ p f^J ^ = U] = 0(Vfi • P |V» € M : 6« = t«] , 
as claimed. □ 



3 Covers 

Let G = (V, E) be a graph, let fc > fco be an integer, and let a : V — > [fc] be a fc-coloring of G. 
We would like to identify a set F C V of vertices whose colors cannot be changed easily by a "local" 
recoloring of a few vertices. For instance, if v is a vertex that does not have a neighbor of color j for some 
j G [k] \ {a(v)}, then v can be recolored easily. More generally, we would like to say that, recursively, 
a vertex can be recolored easily if there is a color j such that all its neighbors of color j can be easily 
recolored. To formalize this, we need the following concept. 

Definition 3.1 Let £ : V — > {0, 1, . . . , k}. We call v E V stable under ( if C,(v) ^ and if for any color 
j G [fc] \ {C( w )} there are at least two neighbors u\, of v such that C( u i) = C( M 2) = j- 

Now, consider the following whitening process that, given a fc-coloring a of G, returns a map a : V —> 
{0, 1, ... , fc}; the idea is that a(v) = for all v that are easy to recolor. 

WH1. Initially, let a(v) = a(v) for all v G V. 

WH2. While there exist a vertex v E V with a(v) ^ that is not stable under a, set a(v) = 0. 

The process WH1-WH2 is similar to processes studied in [36. 37) in the context of random graph coloring, 
and in j7| in the context of random fc-SAT. (The term "whitening process" stems from 11361 .) Clearly, the 
final outcome a of the whitening process is independent of the order in which WH2 proceeds. 

The intuition behind the whitening process is that if we attempt to recolor some stable vertex v with 
another color j G [fc] \ {&(v)}, then we will have to recolor two additional stable vertices m, Ui as well. 
Hence, any attempt to recolor a stable vertex is liable to trigger an avalanche of further recolorings (unless 
the graph G has an abundance of short cycles, which is well-known not to be the case in the random graph 
G(n, m) w.h.p.). 

The following definition is going to lead to a neat description of the outcome of the whitening process. 
Definition 3.2 A k-cover in G is a map Q : V — > {0, 1, . . . , fc} with the following properties. 
CV1. There is no edge e = {u, v} such that C{ u ) = C{ v ) 7^ 0. 
CV2. If C( v ) 0, then v is stable under 

CV3. If C(i>) = 0, then there are i,j G [fc], i =/= j, such that v does not have a neighbor u with C,{u) = i 
and v has at most one neighbor w with C{ w ) = j- 

The concept of covers is very closely related and, in fact, inspired by the properties of certain fixed points 
of the Survey Propagation message passing procedure I13T1 . (To my knowledge, the term "cover" has not 
been used previously in the context of fc-colorability, although it appears to be in common use in the context 
of satisfiability ||29l .) 

Now, the outcome of a is the cover characterized by the following two properties. 

i. For all vertices v such that &(v) ^ Owe have a(v) = cr{v). 

ii. Subject to i., |<r _1 (0)| is minimum. 
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Of course, in general the graph G may have many fc-covers that cannot be obtained from a fc-coloring via 
the whitening process. This motivates 

Definition 3.3 A k-cover £ of G is valid if G has a k-coloring a such that £ = a. 

To prove Theorem ll.il we perform a first moment argument for the number of valid fc-covers. The 
main task is to show that the all-0 cover (i.e., £(i>) = for all vertices v) is not a valid A; -cover in G(n, m) 
w.h.p. To this end, we need to establish a few basic properties that all fc-colorings of G(n, m) have w.h.p. 
More precisely, in Section |4] we are going to prove the following via a "standard" first moment argument 
over fc-colorings. 

Proposition 3.4 Assume that k > ko for a sufficiently large constant ko. Moreover, assume that d = 
2k In k — In k - c, with < c < 4. 

1. Let Z be the number of k-colorings of G(n, to). Then InE [Z] = c+ °^ 1 ^ . 

2. W.h.p. all k-colorings of G(n,m) satisfy |er -1 (i)| = (1 + Ofc(l))r for all i £ [k], 

3. In fact, w.h.p. G(n,m) does not have a k-coloring a such that \o~~ 1 {i) — n/k\ > nj (k In 4 k) for 
more than In 8 k colors i £ [k]. 

Building upon Proposition [33] we will establish the following properties of valid fc-covers in Section[5] 

Proposition 3.5 There is a number ko such that for fc > fco and 2k In fc — In fc — 4 < d < 2k In fc any valid 
k-cover £ ofG(n, m) has the following properties w.h.p. 

1. We have |C _1 (0)| < nk~ 2 / z . 

2. For all i £ [fc] we have |C~ 1 (*)| = (1 + Ofc(l))n/fc. 

3. In fact, there are no more than In 9 fc indices i £ [fc] such that |C~ 1 (*) — n/k\ > n/(fcln 3 fc). 
Finally, in Section|6]we perform the first moment argument over fc-covers. 

Proposition 3.6 There is = 0fc(l) such that for d > 2fclnfc — lnfc — 1 + Sk w.h.p. the random graph 
G{n, m) does not have a k-cover with properties 1— 3. from Proposition ^ .5\ 

Theorem 1 1 . 1 1 is immediate from Propositions I3.5l and 13.61 Furthermore, we will prove Corollary II. 2 1 in 
Section[5] 



4 Proof of Proposition 1X4 



The proof of Proposition 13 ,4l is very much based on standard arguments, reminiscent but unfortunately not 
(quite) identical to estimates from, e.g., 0. Suppose d = 2fclnfc — lnfc — c with < c < 4. Throughout 
this section we work with the random graph G'(n, m) with m independent edges. 



Lemma 4.1 Let v = {i>\, . . . , v^) be a k-tuple of non-negative integers such that Y) . 
the number of k-colorings a of G'(n,m) such that cr _1 («) = fiforalli£ [fc]. Then 



n. Let Z v be 



InE [Z v \ = o{n) + ^ v % ln(n/^) + - In 



'-EG 



(6) 
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Proof. Let £„ be the set of all a : V — > [k] such that ler 1 (£) I = i/j for all i € [fc]. By Stirling's formula, 



ln|£„| = o(n) + z/, ]n(n/v>i) 



(7) 



i=i 



Furthermore, the probability of being a fc-coloring in G'(n, m) is the same for all a 6 £„. In fact, due to 
the independence of the edges in G'(n, m), this probability is q = (1 — X^iC^A 1 ) 2 )" 1 ' because a is a 
fc-coloring iff each of the color classes cr~ 1 (i) is an independent set. As E [Z„] = \T, U \ ■ q, the assertion 
follows from (jTJ. □ 

Corollary 4.2 Lef Z be the total number of k-colorings of G'(n, m). We have 

- InE [Z] = lnfc + ^ ln(l - Ilk) + oil) = + O fc (lnfc/fc 2 ). 
n 2 2fc 

Proof. Let A/" be the set of all fc-tuples v = (yi, . . . , v k ) of non-negative integers such that J^i=i "i = n - 
Then E [Z] = E \ Z A ^ nfe max ^e.V E [ Z A ■ Hence, LemmaHTTIvields 



1 { k d \ k 

-lnE[Z] = o(l) + maxi V%.;ln(?V^)+ ~ ln 1 - ^ (~ 



Letting A be the set of all fc-tuples a = ia.\ , . . . , a k ) G [0, l] h such that X)i=i a i — 1> we obtain from dHJ 



i=l 

fc „ 1 * v^fe 



SAT 



(8) 



1 { k d 

- InE [Z] = oil) + max < - on hiion) + - In 

77 ^ — ^ 9 



(9) 



The entropy function — X)i=i a i ha(cKi) i s well-known to attain its maximum at the point a = ^1 with all 
fc entries equal to 1/fc. Furthermore, the sum of squares Yl%=i a 1 at; tains its minimum at a = -^1 as well. 
Hence, the term | ln[l — 2»=i a ?]> anc l mus ®' I s maximized at il. Consequently, 



-lnE[Z] = lnfc + -ln(l - 1/fc) + o(l) = lnfc - - 
n 2 2 



In fc 



fc In fc 



lnfc 



-+0(lnfc/fc 2 ), 



as claimed. 



□ 



Corollary 4.3 W.h.p. all k-colorings a of G' (n,m) satisfy \a = (1 + o k (l))^ for all i £ [fc], and 

there is no k-coloring a such that \a~ l {i) — l/k\ > l/(fcln 4 fc) for more than In 8 fc colors i £ [fc]. 

Proof. Let v = [y\, . . . , v k ) be a fc-tuple of non-negative integers such that J2i=i v i = n - We are going 
to estimate E[ZJ in terms of how much v deviates from the "flat" vector (n/k, . . . , n/k). To this end, we 
compute the first two differentials of ©. Set a = (ai, . . . , ot k ) = v/n and let f(a) = — 5TJ i=1 a ; hict; + 

f m (l — Si=i a f )■ Since X^=i a i = 1> we can eliminate the variable a k = I — 2»=i a «- Hence, we 
obtain for i, j G [fc — 1], i ^ j 



df_ 

da.i 

d 2 f 



]n(a k /oii) + 



1 1 



d(a k - aj) 

1 - Il«ll2 

2d 



2d(a k - ctiY 



a k ai l-\\a\\ 2 2 (l-||a|| 2 ) 2 ' 
1 d 2d(a k — cei)(a k — aij) 



a k l-\\a\\ 



(1-Il«ll2) 2 



(10) 

(11) 
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In particular, the first differential Df vanishes at a = ^1. At this point, the Hessian D 2 f = { Q^.g a . )ije[k—i] 
is negative-definite, whence a = il is a local maximum. Because the rank-one matrix ((o^ — a.;) • (ctk — 
a j))i,je[k-i] i s positive semidefinite for all a, dTOb and ( fTTT > show that D 2 f is negative-definite for all a. 
In fact, due to the — , 2d ..■> term in dTOb . all its eigenvalues are smaller than — - — ^^7*^ < —d. Therefore, 
Taylor's theorem yields that 

/(«) < - § ||« - * _1 l|la 

for all a. Hence, Corollary 14. 2l implies that 

^mE[Z„] < ^ + O fc (liifc/fc 2 )-^||a-fc- 1 l||J. (12) 

Since d = (2 — Ofe(l))fc In k, the right hand side of (TTZb is negative if either 

• max !£ [ t ] |a, — fc _1 | > (A; In 1 ' 3 A) -1 , or 

• there are more than In 8 k indices i G [k] such that \a- t — l/k\ > (k In 4 fc) _1 . 

Thus, Markov's inequality shows that w.h.p. there is no /c-coloring with either of these properties. □ 
Finally, Proposition l3.4l is immediate from Lemma |2~T1 and Corollaries l4.2l and l4.3l 



5 Proof of Proposition 133 



Suppose d = 2k In k — In k — c with < c < 4. Throughout this section we work with the random graph 
G'(n, m) with m independent edges. 



5.1 The core 

Let a : V -> [k] be a map such that |cr _1 (i)| = (1 + Ofc(l))| for all i € [k]. Moreover, let G'(cr) be 
the random multi-graph G'(n, m) conditional on a being a valid /c-coloring. Thus, G'(cr) consists of m 
independent random edges ei = (tti, ui), . . . , e m = (u m , « m ) such that cr(iij) ^ cr(uj) for all i £ [m]. 
To prove Proposition 13.51 we need to show that with a very high probability, a large number of vertices 
of G'(cr) will remain "unscathed" by the whitening process WH1-WH2. To exhibit such vertices, we 
consider the following construction. Let £ = cxp(— 7) Ink and assume that k > ko is large enough so that 
I > 3. Let Vi = a- x {i) for i G [k]. 

CR1 For i G [fc] let W 4 = G F» : 3j £ i : e(v, Vj) < 31} and W = IjLi w i- 
CR2 Let U = {v eV :3j : e{v, Wj) > £}. 

CR3 Set Y = U. While there is a vertex v E V\Y that has I or more neighbors in Y , add w to Y . 

We call the graph G'(tr) — W — Y obtained by removing the vertices in U Y the core of G'(tr). By 
construction, every vertex t> in the core has at least I neighbors of each color j ^ <r(v) that also belong to 
the core. In effect, if a is the outcome of the whitening process applied to G'(a), then a(v) = <r(v) for all 
vertices v in the core. 

The construction CR1-CR3 has been considered previously to show that a random fc-coloring of the 
random graph G(n, m) has many frozen vertices w.h.p. fl2] [T2) . In the present context we need to perform 
a rather more thorough analysis of the process CR1-CR3 to show that w.h.p. all fc-colorings a of G(n, m) 
induce a non-zero cover a. To obtain such a strong result, we need to control the large deviations of various 
quantities, particularly the sizes of the sets W, Wi and U. More precisely, in Section IBT21 we prove 

Lemma 5.1 With probability at least 1 — exp(— 16n/fc) the random graph G'(er) has the following prop- 
erties. 
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1. We have \ W\ < nk 



-0.7 



2. For all i £ [fcl we have \ WA < 



n In In k 



klrvk 

3. There are no more than In 4 k indices i € [k] such that \ Wi\ > r A fc . 

Moreover, in Section IB31 we are going to establish 

Lemma 5.2 In G'(a) we have P [\U\ > ,J j^] < exp(-10n/fc). 

To estimate the size of Y we use the following observation. 

Lemma 5.3 W.h.p. the random graph G'(n, m) has the following property. 

For any set y C V of size \y\ < [^xf^l we have e(y) < (13) 

Proof. For any fixed set y of size < yn < [ fcinfc 1 tne number e(J^) of edges spanned by y in 
G'(n, to) is binomially distributed with mean 



2 



E [e(y)\ = V^L = (i + 0fc (i))j / 2 dn/2 < 2y 2 nfclnfc 
Hence, by the Chernoff bound 

P [e(y) > ytn/2] < exp 



yin ( yinll 
In ' 



< 



exp 



(14) 



< exp [j/n(l — lny)] < cxp(— 2j/nlnj/). (16) 



2 Ve-E[e(^)]. 
Since ky < 3 In In fc/ In fc and £ = tt k (In fc), (|T4]> yields 

P [e(y) > yin/2] < exp [3yn ln(y)] . (15) 
Further, by Stirling's formula the total number of sets y C V of size yn is 

» 

Combining (TT~5T > and ( fTol l with the union bound, we obtain 

P [3y C V : M = yn, e(y) > £ \y\ /2] < exp [ynMy)} . 

Taking the union bound over all possible sizes yn completes the proof. □ 

Proof of Proportion \3.5\ By Proposition ^. 41 w.h.p. all fc-colorings a of the random graph G'(n, to) satisfy 
| cr 1 (i) | = (1 + Ofc(l))n/fc for all i e [k]. Let us call such a fc-coloring a of G = G'(n, m) good if it has 
the following two properties (and bad otherwise): 

Gl. Step CR1 applied to G, a yields sets W\, . . . , Wk, W that satisfy the three properties in Lemma IBTTI 
G2. The set U created in step CR2 has size \U\ > "^ fc . 

Let Zbad be the number of bad fc-colorings of G'(n, m). Since G'(cr) is just the random graph G'(n, to) 
conditional on a being a fc-coloring, we have 

E [Zbad] = ^2 P [°" is a fc-coloring of G(n, m)] ■ P [a is bad in G'(n, to) \a is a fc-coloring] 
= P [cr is a fc-coloring of G(n, m)] ■ P [cr is bad in G'(cr)] 
< 2exp(-10n/fc) • J^P [cr is a fc-coloring of G(n, to)] [by Lemmas l5.1l and 



< 2 exp(— 10n/fc) • exp(cn/fc + o(n)) [by Proposition s? 

< exp(-6n/fc + o(n)) = o(l). [as c < 4] 
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Hence, w.h.p. the random graph G'(n, m) does not have a bad fc-coloring. 

Now, consider a good fc-coloring a. By Lemma [531 we may assume that ( [T3l holds. To bound the size 
of the set Y created by step CR3, observe that each vertex that is added to Y contributes I extra edges to 
the subgraph spanned by Y . Thus, assume that \Y\ > ^rpr k an d consider the first time step CR3 has 
got a set Y' of size 

r 2 fc'infc fc l ■ Then Y ' s P ans at least y £n / 2 ed 

ges, in contradiction to H3i . Hence, w.h.p. 

G'(n, m) is such that 

for any good fc-coloring the set Y constructed by CR3 has size at most 2v j} ° p . (17) 

If ( TP71 ) is true and G'(n, m) does not have a bad fc-coloring, then for any fc-coloring a the set W U Y 
constructed by CR1-CR3 has size at most \ W U Y | < fc~°- 7 n + 2n k 1 ^ k < nk- 2 / 3 (the bound on | W\ 
follows from Gl). This shows the first property asserted in Proposition 13. 51 because the construction CR1- 
CR3 ensures that the cover a obtained from a via the whitening process WH1-WH2 satisfies a(v) = a{v) 
for all v € V \ (W U Y). By the same token, the second assertion follows because by Gl and $1% for 
every color i e [k] we have 

| ( x- 1 (i)n {WUY)\ < \Wi\ + \Y\=n-o k (l/k). 

Finally, the Gl and ( TTTb also imply that there cannot be more than In 9 fc indices i G [fc] such that (i) — 
n/k\ > n/(k In 3 fc), which is the third assertion. □ 

Proof of Corollary 17.21 We claim that the vertices in F = V\ (W U Y) are ^-frozen w.h.p. for 5 = 
l/(fclnfc). Indeed, assume that r is another fc-coloring such that the set A = {v € F : t(v) ^ &(v)} has 
size < |A| < Sn. Every vertex v 6 A has at least I neighbors in A. Indeed, the construction CR1- 
CR3 ensures that every vertex v € A has at least I neighbors colored t(v) ^ cr(v) in a. Because r is a 
fc-coloring, all of these neighbors must belong to A as well. Hence, A violates ( fT3b . Thus, the assertion 
follows from Lemma I5~3l □ 



5.2 Proof of Lemma 15.11 



We begin by estimating the number of edges between different color classes. Recall that V{ = a 1 (i) for 
i G [fc], and that we are assuming that \Vi\ — (1 + Ofc(l))n/fc. Let vi = \Vi\ for i = 1, . . . , fc. 



Lemma 5.4 In G'(a) we have 
P 
P 



, T , TrN 0.99dn 
mm e(Vi,Vj) < —r^— 
i<i<j<k J fc 2 



,„ Tr , l.Oldn 
max e{Vi,Vj) > — — 
i<i<j<k J/ fc 2 



< exp(— lln/fc) and 

< cxp( — lln/fc). 



Proof. Because the edges e±, . . . , e m are chosen independently, for any pair 1 < i < j < fc the random 
variable e(Vi, Vj) has a binomial distribution Bin(m, g.y ), where 



9ij 



> 



Since we are assuming that Vj = (1 + 0^(1))?, we have qij > (2 + Ofc(l))/fc 2 . Thus, E [e(Vi, Vj] 
mqij > (1 + Ok{l))dn/k 2 . Hence, the Chernoff bound yields 



0.99dn 
2fc 2 



< exp 



• 10 4 fc 2 



< exp(-12n/fc). 



Finally, the first assertion follows by taking a union bound over The second assertion follows analo- 
gously. □ 
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Proof of Lemma \5A\ By Lemma l5~4l we may disregard the case that mini<i<j-<fc e(Vi, Vj) < °' 9 ®f n . Thus, 
fix integers (?7iy-)i<i<^<fe such that 



> 



0.99dn 



and 



m * 

l<i<j<k 



(18) 



Let Ai be the event that e(Vi, Vj) = rriij for all 1 < i < j < k. 

We need to get a handle on the random variables (e(v, Vj)) v ^Vi (i-^-, the number of neighbors of v in 
Vj) in the random graph G'(a). Given that Ai occurs we know that J2 v ev- e ( v ' ^j) = e (^j ^i) = m ij- 
Furthermore, because G'(a) consists of m independent random edges e%, . . . , e m , given the event Ai 
the rriij edges between Vi and Vj are chosen uniformly and independently. Therefore, we can think of 
the vertices in Vi as "bins" and of the rriij edges as randomly tossed "balls". In particular, the average 
number of balls that each bin v G Vi receives is rriij /vi- Crucially, these balls-and-bins experiments are 
independent for all 

To analyze them, we are going to use Corollary 12.41 Thus, consider a family {b V j) v ev,je[k]\{a(v)} 
of mutually independent Poisson variables with means E [b V j] = m a ^j /v a ( v y Then for any family 

(tvj)vev,j£[k]\{<r(.v)} of integers we have 



P [V«, j : e(v, Vj) = t vj \M] < exp(o(n)) • P [W, j : b 



'•j 



>>ji 



(19) 



In words, the joint probability that the random variables (e(i>, Vj)) v& v,je[k]\{a(v)} take certain values given 
that Ai occurs is dominated by the corresponding event for the random variables (b V j). 

If \Wi\ > n l n ^ n k k , then there are at least N = n k l 1^ k k vertices v e Vi such that mirijgrfcnm e(v, Vj) < 
3£. Thus, let Wi be the number of vertices v £ Vi such that mirijg^]^} b V j < 3£. Then (fl9l yields 



P [\Wi\ > N\M] < exp(o(n)) ■ P [Wi > N] 



(20) 



Furthermore, because the random variables {b v j)ve.Vi,je[k]\{i} are mutually independent, Wi is a binomial 
random variable with mean E [Wi] < Viqu where qi = J2je[k]\{i} P [P°( m ij l v i) < 3^]. Since v L = 
(1 + o k (l))n/k and rriij > 0.99dn/k 2 , we have Hij/vi > 0.98d/fc > 1.95 In k. Recalling that I = 
exp(-7) lnfc, we find P [Po^-/^) < 3t] < fc" 19 and thus q t < (k - l)k~ 19 . Hence, 



E[Wi] < (l + o fc (l))A 
Therefore, the Chernoff bound gives 

> k 1.8 N 



- 19 n < k~ 18 r 



P [Wi >N}< exp 
Combining ( f2Qb and d22l . we obtain 



-N In 



< exp(-20n//c). 



P [\Wi\ > N\M] < exp(o(n) - 20n/k) < exp(-19n/fc). 



(21) 



(22) 



(23) 



Now, consider the event that there are at least n = [In 4 fc] classes i\, . . . , i K such that \ W%\ > N' 



k In 4 k 



. We have 



-JV'ln 



P [W i} > N'] < exp 
Furthermore, because the random variables , . . 

P[\{i€[k}:[Wi\>N'}[>K\M} < 



k i.s N > 



< 



exp 



—N' Ink 
2 



(24) 



yVi K are independent, we obtain from ([19) and (l24l 



exp 



--AT' lnfc 
2 



< exp(-20n/fc). 



(25) 



With respect to the event \ W\ > nk 7 , observe that by (|2TT > the sum W = Y^i=i W« I s stochastically 
dominated by a binomial random variable with mean nfc -0 ' 8 . Therefore, by ([19} and the Chernoff bound 

P [\W\ > nk-°- 7 \M] < P [W > nk- - 7 } < cxp(-n£T a7 ) < exp(-20n/fc). (26) 

Finally, since the estimates ( l23l , ( f25l l, d26l l hold for all Ai, the assertion follows from Bayes' formula. □ 
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5.3 Proof of Lemma 15.21 

We begin by estimating the number of edges between the sets Wi and the color class Vj . As before, we let 
that Vi = cr _1 (i) fori G [k] and v l = |V*| = (1 + o k (l))n/k fori = 1, ... , k. 



Lemma 5.5 In G'(cr) we have 
P 



max e(Wi,Vj) > 

l<i<j<k 



2n In In k 



< exp(— lln/k). 



Proof. Fix 1 < i < j < k. We begin by proving the following statement. 

For any set S C V of size |5| < "^ fc we have P [e^, V,) > 2ralnlnfc/fc] < exp(-13n/fc). (27) 
Indeed, for any set S as above the number e(S, Vj) of edges that join 5 to V, has a binomial distribution 



Bin(m, (y,,s), where 



2 Vj \S\ 



< 



3 In In k 
k 2 lnk ' 



n2 ~ Ei=i ^ 2 

the last inequality follows from our assumption that i/j = (1 + Ofc(l))n/fc for all Z G [fc]. Hence, 



E[e(5,^)] 



3d In In k 3 In In k 

m Qj,s < , . „ . — — • 7i < — — — ■ n 



2k' 2 Ink 



2k 



Thus, ( l27l i follows from the Chernoff bound. Taking the union bound over all possible sets S of size 
\S\ < " 1 " ll l fc , we obtain from <|27j 



. _. nlnlnA; , ^, TrN 2nlnlnfc 
35 c Vu \S\ < : e{S,V-) > y— 



k In k 



< 2^ exp(-13n/jfe) < exp(-12n/fc). (28) 



AsP > 



n In In 1 
kink J 



< exp(— 16n/fc) by Lemma I5T1 the assertion follows from 



□ 



Lemma 5.6 Let Ti be the number of vertices v G Vi such that max^j e(v,Vj) > 100 In A; and let T 
Sie[fe] Ti- Then in G'(a) we have 



T > 



4fclnfc 



< exp(— 10n/fc). 



Proof. For an integer vector m = (my)i<j<j-<fc let f^, be the event that e(V^, Pj) = my for all 1 < 
i < j < k. Set rriji = my for 1 < i < j < k. By Lemma 15741 we may confine ourselves to the case 
that e(Vi, Vj) < ^ for all i ^ j. Thus, fix any m such that my < ^ for all i < j. Given £ m , for 
each of the my edges between color classes Vi, Vj the actual vertex in Vi that the edge is incident with is 
uniformly distributed. Thus, we can think of the vertices v G Vi as bins and of edge my edges as balls of 
color j, and our goal is to figure out the probability that bin v contains more than 100 In A: balls colored j 
for some j ^ i. Because we are conditioning on £ m , these balls-and-bins experiments are independent for 
all color pairs i ^ j. 



To study these balls-and-bins experiments we use Corollary 12.41 Thus, let (6 



vj)v£V,j&[k]\{a(v)} 



be a 



family of mutually independent Poisson variables such that E[6„j] = my/fj for all v G Vi, j G [k] \ {i}. 
In addition, let % be the number of vertices v G Vi such that max^ b V j > 100 In k and let T — Yli=i T%. 
Then Corollary I2.4| gives 



T > 



4k In k 



f 



< exp(o(n)) • P 



T > 



4k In k 



(29) 



To complete the proof we need to bound P [T > 4fc " n k ] . For each vertex v G Vi and each j ^ i we 
have E [b V j] = rriij/ui < W^-. < 31nfc. Hence, by Stirling's formula 

P [b vj > 100 In k] < E W l sl - k 9 °- 

s>100 Ink 
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Because the random variables b V j are mutually independent, T is a sum of independent Bernoulli random 
variables. Applying the union bound, we thus have 



max b V j > 100 lnfc 



< k 



,-89 



for any v € V. 



(30) 



Therefore, (l30T > shows that T is stochastically dominated by a binomial random variable Bin(n, fc 89 ). 
Consequently, the Chernoff bound yields 



T > 



< P 



Bin(n,fc -89 ) > 



4fc In A; 

Finally, combining d29b and OTT ) yields the assertion 



4fclnfc 



< exp(-20n/fc). 



(31) 

□ 



Proof of Lemma \5?2\ Let d = (duj)uev;je[fc]\{o-(ii)} be an integer vector. Moreover, let Ed be the event that 
e(v, Vj) = d V j for all v E V, j ^ &(v). We are going to estimate the size of U given that Ed occurs for a 
vector d that is "compatible" with the properties established in Lemmas l5.4145.6l More precisely, we call 
d feasible if the following conditions are satisfied. 



i. For all i ^ j we have to. 



12veVi dvj > Moreover, m, 



TO," 



ii. For all i ^ j we have Wij = ^2 vi 



< 



2n In In k 



veVi:d vj <3e — k 

iii. Let T be the set of all vertices v such that max^^) d v j > 100 In k. Then \T\ < 4fc " nfc . 
By Lemmas 15.4145. 61 we just need to show that for any feasible d we have 



, , n In In k , „ 



< exp(-10n/fc). 



(32) 



Given the event Ed, the total number my of Vj-Vj ^ -edges is fixed. So is the number Wji of Wj-Vi edges. 
What remains random is how these edges are matched to the vertices in Vi, More specifically, think of 
the Wj-Vi -edges as black balls, of the Vj \ Wj-Vi -edges are white balls, and of the vertices v € Vi as 
bins. Each bin v has a capacity d V j. Now, the balls are tossed randomly into the bins, and our objective 
is to figure out the number of bins that receive more than I black balls. Observe that these numbers are 
independent for all pairs i ^ j of colors. 

To estimate this probability, consider a family (b V j) v ev.je[k]\{cr(v)} of independent binomial random 
variables such that b V j has distribution Bin(d^j, Wji/rriji). Let B be the event that J^vev- ^vj = Wji for 
all i j. Furthermore, let U be the number of vertices v such that max^^) b V j > I. Then 

p ^ n In In k ] 



, , n In In k , „ 



nlnlnfc , 



< 



k In k 



P[B] 



(33) 



The sums J^vev- are binomial random variables Bin(TOy, Wji/rriji). Moreover, they are indepen- 



dent for all i ^ j. Therefore, Stirling's formula yields 

mij,Wij/mij) = rriij] 



P [B] = JJ P [Bin( 

i¥=3 



9( n -*(*-i)/2) = exp(o(n)). 



(34) 



Let u € Vi be a vertex such that for color j ^ i we have d W j < 100 lnfc. Then our assumptions i. and 



ii. on d ensure that E [b. 



— < 300 In In k. Therefore, by the Chernoff bound 



P [b vj >£}< exp 
Hence, taking the union bound, we find 



-Mn 



■E\b 



"31 



< k~ 



max b v j > £ 



< k~" if max d vj < 100 lnfc. 



(35) 
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Let U' be the number of vertices v such that maxj^ a / v ) b V j > I while max,-^!,,) d V j < 100 In fc. Because 
the random variables b v j are independent, d35l l implies that W is stochastically dominated by a binomial 
random variable Bin(n, fc -99 ). Therefore, the Chernoff bound gives 



n In In fc 
2k In k 



< P 



Bin(n, fc Jy ) > 



; In In k 



2fclnfc 



< exp(— lln/k). 



(36) 



As U < T + W < W + jj^j: by our assumption iii. on d, (O implies that P [U > n ^° fe ] < 
exp(— lln/k). Thus, the assertion follows from ((33) and ((35). □ 



6 Proof of Proposition 1X6 



Throughout this section, we let £ : V — > {0, 1, . . . , k}, Vi = £ 1 (i) and V{ = \ Vi \ for i = 0,1, ... ,k. In 
addition, we let a* = i^/ n. We always assume that the conditions of Proposition l3.6l hold, namely 

Zl. |C _1 (0)| < nk- 2 ^. 

Z2. |C _1 (i)l = (l + o fc (l))n/fcforalHe [fc]. 

Z3. There are no more than In 9 k indices i g [fc] such that |C~ 1 (*) — n/k\ > n/(fc In 3 fc). 
In addition, we assume that d = 2k In fc — In fc — c for some < c < 4. 



6.1 Counting covers 



To prove Proposition [3T6] we perform a first moment argument over the number of covers (. Let Iq be the 
event that V\, . . . , Vk are independent sets in G'(n, m). Moreover, let Q be the event that £ is a fc-cover 
in G'(n,m). Clearly, Cq C 1q, and we begin begin by estimating the probability the latter event. Let 

^ = £^4 

Lemma 6.1 We have i In P [l c ] = | ln(l - F c ). 

Proof. For each of the edges the probability of joining two vertices in Vj is (vj/n) 2 — a 2 . Hence, the 
probability that e.; does not fall inside any of the classes Vi, . . . , \% is equal to 1 — F{. Thus, the assertion 
follows from the independence of ex, ... , e m . □ 

In Section l6T2l we are going to establish the following estimate of the probability of C a . 



Lemma 6.2 We have i InP [C^ |2^] < J2i=o a i m P« + °(1)> where 



Pa 



P, 



E 

i,je[k]:ijtj 

n i 



ctjd 
1-Fc 



ctjd 
I -Ft 



cxp 



cxp 



l-f> 



l-f> 



/or i = 1 , . 



Proof of Proposition 13.61 Let A be the set of all vectors a = (ao, . . . , a^) € [0, l] fc+1 that satisfy the 
following three conditions (cf. Z1-Z3): 

L Ei=o a* = !. 

ii. We have ao < fc -2 / 3 and a» = (1 + Ofe(l))/fc for z = 1, . . . , fc. Indeed, there are no more than 
K = In 9 fc indices i e [fc] such that |<x t - l/fc| > fc" 1 In" 3 fc. 



iii. o^n is an integer for i = 0,1, 
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For a € A let S a be the set of all maps Q : V — > {0, 1, . . . , fc} such that \( = can for all i. Then 



S n — 



( n \ = ( n y ( (! - a o) n ^ < ( n y fe (i-«o)n_ 

\a n, . . . , a fc n/ V a o«/ \ain, . . . , a k nj \a nj 



Hence, Stirling's formula yields 

— lnS a < — ao In ao — (1 — ao) ln((l — cto) Ik). (37) 
n 

Lemmas |6 . 1 1 and lo*72l show that for any £ G 5a, 

-| i fe 

-lnP[C c ] < o(l) + -ln(l-F c )+^a i lnp i . 

«=o 

Given the value of ao, the sum = Xa=i a f i s minimized if a, = (1 — ao)/k for all i £ [fc]. Thus, 

-IT 

-lnP[C c ] < o(l) + -\n(l-(l-a ) 2 /k) + Y / a l \n Pt . (38) 

Using the approximation ln(l — z) = —z — z 2 /2 + 0(z 3 ) and recalling that d = 2fclnfc — lnfc — c, we 
see that 

^ln(l-(l-a ) 2 A) = -(l-a ) 2 hifc 

(l-a ) 2 lnfc + c(l-a ) 2 _ (1 -a ) 4 lnfc + Q , k -iA\ 
2 /c 2/c 2 /c 

= -(l-2ao)]nfc + ^+Ofc(l/fc) [as a < fc- 2 / 3 by ii.]. (39) 

Furthermore, because G (0, 1) and as ln(l — z) < — z for all z € (0, 1), we get 

k 

Qjjlnpi < — + Q?j(i) exp(— ajd) 

t=l i,j'G[fc]:i/j 

= — (1 — «o — + Qijd) cxp(— ayd). (40) 

je[k] 

Since ay = (1 + o k (l))/k for all j e [fc] by ii. and as d = 2fclnfc — Ofc(lnfc), (gOjl yields 

k 

^(Xilnpi < O k {k- 1 - 9 )-(l-a )^2{l + a J d)exp(-2a J khik). (41) 

i=i 3 -e[fc] 

Moreover, applying condition ii., we obtain from (|4H 

k 

'^2a l \np l < O k (k- 19 ) +0(K Ink) • cxp(-2(l + o fe (l)) In fc) 

1=1 -(1 - ao)(A - K)(l + 2 lnfc + O fc (l/ln 2 fc)) exp(-2(l + O fc (ln- 3 fc)) In fc) 

1 -I- 9 In h 

< 0fc (l/fe) - (1 - a ) • + ^ [asF<fc 001 ] 

< o fc (l/fc)- 1 + ^ lnfc [asa <fc- 2 / 3 byii.] (42) 
Further, again because € (0, 1) we have 



Po - \ X! (1 + 2a J d)cxp[-(a l + aj)(i] 



2 



< O k {k-' i+ ° k{1) K) + fc( ~ fc 2 - [l + 41nfc + O fe (ln" 2 fc)]exp[-41nfc + 0(ln" 2 fc)] [by condition ii.] 



1 +41nfc + O fe (ln~ 1 fc) 
" 2fc2 ■ 
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Hence, 



aolnpo < aoln 



1 + 4 In fc 
2fc 2 



Plugging ([39]l, (US and (03]) into ([38}, we obtain 

1 , t m i c . , , , 1 + 2 In k 

-lnPCc < — --(l-2a )lnfc ; +a ln 

n 2k k 



ao • Ofe(l). 



1 + 41nfc 



(43) 



+ ao-Ofc(l) + Ofc(l/fc) 



-2-41nfc 
2fc 



— In fc + ao In 



1 + 41nfc 



2fc 2 

a • Ofc(l) + Ofe(l/fc) 



(44) 



Finally, combining ( l37l i and ( l44l ). we get 

1 , „ „ , ix c - 2 - 41nfc 

n 2fc 



2fcao 



1 + 4 In fc 

(1 - a )ln(l - «o) + ■ Ofe(l) + o k (l/k) 

2kao 



c - 2 - 4 In fc 
" 2fc +a ° 



1 - In 



1 + 4 In fc 



o fc (l) 



+ o fe (l/fc). (45) 



Elementary calculus shows that the function ao G (0,1) M> — ao(l — m TT^^fc ° fc W) attams i ts 

c-l + o fc (l) 



maximum at a = (1 + Ofc(l)) 1+ ^ nfc . Hence, (J45J yields 



-ln(|5 Q |-P[C c ]) < 



2fc 



(46) 



To complete the proof, consider for any a E A the number E a of fc-covers £ of G'(n, m) such that 
|C -1 (*)| = a % f° r an *• Then (06]) implies that i lnE[E a ] < ^ - o fc (l) for all a € A. Hence, there is 
< £k = Ofe(l) such that for c < 1 — £k we have 



E[£„] < exp 



c- 1 
2fc 



-o fc (l) 



< exp(— £fe/2) = exp(— fi(n)). 



(47) 



Since condition iii. ensures that \A\ < n h = exp(o(n)), the assertion follows from (|47| | by taking the union 



bound over all a. € A and applying Lemma I2TT 



□ 



6.2 Proof of Lemma 16.21 

Givenl^, the pairs ei, . . . , e TO that constitute the random graph G'(n, m) are simply distributed uniformly 
and independently over the set of all n 2 (l — F^) possible pairs that do not join two vertices in the same 
class Vi for i = 1, . . . , k. For each vertex v and each j € {0, 1, . . . , fc} let d v j be the number of pairs 
such that ei contains v together with a vertex from Vj. Clearly, given 1^ we have d v j =0 for all v G Vj, 

J 6 [4 

It seems reasonable to expect that the d v j are asymptotically independent Poisson random variables. 
To state this precisely, consider a family (b v j)veV,je{o,i,...,k} °f independent Poisson random variables 
with means 

aid 

Let Bq be the event that 

i. for any v G Vo there exist i,j£ [fc], i 7^ j such that 6„j = and 6„j < 1 and 

ii. for any 1 < i < j < k and any v G we have > 1. 



The key step in the proof (somewhat reminiscent of the Poisson cloning model [25]) is to establish the 
following. 
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Lemma 6.3 We have P [C C |I C ] < exp(o(n)) • P [B, 



CI- 



To prove Lemma 1531 we consider a further event. Set By = J2 v <ev- ^vj for hj € {0, 1, ... , fc}, 
7^ (0,0). Being sums of independent Poisson variables, the random variables By are Poisson as 
well, with means 

E[By] = E [Bji] = a iaj dnl{l - F c ) {0<i<j< fc). 

In addition, let Boo be a random variable that is independent of all of the above such that ^Boo has 
distribution Po(a§m/(l — F^)). (In particular, Boo takes even values only.) Now, let V be the event 
that 



i. B, 



ii. ^Bqo 



, ld — B.^ for all ij^j and 

£o<j<j<fe B%j = Tn. 
Lemma 6.4 We have P [V] = exp(o(n)). 
Proof. Since 

Bqq 
2 



E 



E ^ 

l<i<j<k 



dn 



2(1 -F(0) 



E 1 



there exist integers /3y = E[By] + 0(1) such that /3y = ^ and |/3 o + X)o<i<j<fc -% 
P [V] > P [By = /?y for all i, j] = J[ P [By = /?y] . 



to. Clearly, 

(48) 



Since /3y = E [By] + O(l) and By is a Poisson variable, Stirling's formula yields P [By 
nin- 1 / 2 ). Therefore, g8) implies P [V] > f2(n-( fc+1 ) 2 / 2 ) = exp(o(n)), as claimed. 

Proof of Lemma \6J\ Let m = (my)jy e {o,i,...,fe} be a family of non-negative integers such that 

a. TOy = rriji for all i, j, 

b. ma = for i <G [fc] and 

c m 00 + T, <i<j<k m v = m - 
Let M m be the event that 

d V Q = 2moo and d v j = my for any < i < j < k. 



□ 



Analogously, let Ai' m be the event that 

Boo = 2?7ioo and By = TOy for any < i < j < k. 
We claim that for any m that satisfies a.-c. above we have 

P[C ( \M m ] = P[Bc\M' m }. 



(49) 



Indeed, let either i = j = or < i < j < k. Given that M. m occurs, we can think of the TOy edges 
that join Vi and Vj as balls and of the vertices v £ Vi as bins. Each ball is tossed into one of the bins 
randomly and independently, and these events are independent for all i,j. Thus, d49b simply follows from 
the Poissonization of the balls and bins experiment (Lemma [2. 3 1 ). 

To complete the proof, we need to compare P [.M m |Zf] andP [.M m |V]. Because under the distribution 
P [ • |I,j] the pairs ei, . . . , e m are simply chosen randomly subject to the constraint that none of them joins 
two vertices in the same class Vi, i £ [fc], we see that 



P [.Mm ft] 



moolllo 



<i<j<k mi 3- 



I -Ft 



n 

0<i<j<k 



(50) 
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(The factor of 2 arises because e\, . . . , e rn are ordered pairs.) Furthermore, because V provides that By = 
Bji for all we have 

prjM , |v] P [Bqq = 2m 00 ] ■ IIo<i<j<fc P [gg = w d 

[ ml J P[V] 

Thus, by Lemma l6~4l 

P[M' m \V] = cxp( (n)) • P [Boo = 2 ? 7i 00 ] • J[ p [By = my] . (51) 

0<i<i<fc 

Since for < i < j < k the random variables By are Poisson with mean onajdn/ (1 — ), we have 

p [B _ ] _ (giajdn/(l - gcg _ f g^j \ ? "' J ^ (52) 

y y niy! exp(a,aydn/(l — i^)) \1— -Fc/ TO u- exp(2a l a :) m/(l — F^)) ' 

Similarly, 



(ggm/(l-F c )r°° = N m °° 

m o!cxp(agm/(l - F c )) \ 1 - F c 7 m 00 ! exp(agm/(l - F c )) ' 



P [B 00 = 2 W00 ] = - , > Evu = ( T~z?7 ) ~ .^.L^ ^ - (53) 



Combining (l50ll-(l53Tl, we obtain from Stirling's formula 

P[7W m |X c ] m! exp(m(l - F^jal + 2 Eo< i<J < fc "."j)) 

p [-A^m|V] cxp(o(n))m m 

m!exp(m + o(n)) , , 

= — = cxp o i • (54) 

Finally, combining (l49l and ( T54b we conclude that for any m that satisfies a.-c. we have 

P{C ( r\M m \l ( ] = P[C c \M m ]-P[M m \T C } [asM m cZ c ] 
= exp(o(n))P[B c \M' m ]-P[M' m \V] 
= exp(o(n))P[B c nM' m ]/P[V] 

= exp(o(n)) P [B( n M' m ] [due to Lemma l6~4l. 

Summing over all possible m completes the proof. □ 
Proof of Lemma EOl We are going to bound the probability of the event B^. For v E Vq we have 

P [31 < i < j < k : b m = b vj = 0] < 51 P [6rf = 6^ = 0] + ^ P [b m = 0, 6^ = 1] = Po , 

l<Kj<k i,je[k]:ijtj 

because the b V i, b V j are independent Poisson variables. Similarly, if v E Vi for some i G [k], then 

P [V? G [k] \ {i} : b V3 > 1] = J] l-P[b vj <l]= Pi . 

ie[fc]\{i} 

Due to the mutual independence of the 6 t ,j, we thus obtain P ] = p^ 071 Yli=i P? in - Finally, the assertion 
follows from Lemma 1531 □ 
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